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Introduction 

A fundamental  problem  in  the  study  of  picture  perception 
simply  involves  explaining  how  two-dimensional  pictures  and  photo- 
graphs are  capable  of  producing  accurate  impressions  of  three- 
dimensional  spatial  layouts.  In  part,  this  representational  quality 
is  based  on  similarities  between  the  optic  array  projected  from 
the  picture  and  that  from  the  real  scene.  Under  "ideal"  conditions 
a picture  can  present  to  the  eye  a near  simulation  of  the  scene 
it  represents.  Given  a near  identity  of  the  two  arrays,  any  theory 
can  explain  picture  perception. 

But  this  ideal  case  is  rarely  achieved  or  even  closely  approx- 
imated. As  Evans  (1960),  Pirenne  (1970),  and  others  have  noted,  the 
pictorial  array  may  differ  from  the  environmental  array  in  many 
respects:  the  presence  of  texture  from  the  picture  surface,  a 
narrowing  of  the  range  of  colors  and  luminances,  the  presence  of 
borders,  and  so  forth.  Furthermore  a picture  is  rarely  viewed 
from  the  correct  point  of  observation.  Consequently,  there  may 
be  striking  geometrical  distortions  in  the  array  that  is  projected 
to  the  observer's  eye. 

To  the  extent  that  perception  of  spatial  relations  in  pictures 
depends  on  detailed  geometric  isomorphism,  we  should  expect  that 
viewing  a picture  from  an  incorrect  location  would  affect  the 
perception  of  layout.  Yet  ordinary  experience  sugqests  that  such 
effects  are  weak  or  non-existent  under  some  conditions.  As  we  walk 
past  a painting  or  photograph,  we  perceive,  apparently  accurately, 
what  is  depicted  in  the  picture.  The  existence  of  such  a constancy 
phenomenon  has  suggested  to  some  (e.g.,  Pirenne,  1970)  that  the 
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perception  of  pictures  must  involve  a compensation  process  that 
enables  viewers  to  discount  the  effects  of  projective  transforma- 
tions on  the  depicted  space.  If  such  constancy  processes  exist,  they 
must  work  only  within  certain  circumscribed  limits.  A simple  dem- 
onstration that  such  limits  exist  can  be  seen  in  anamorphic  art. 

Even  the  simplest  projective  transformation  used  in  anamorphic  art 
can  not  be  discounted  or  compensated  for  when  the  picture  is  viewed 
from  the  frontal -paral lei . Perception  of  the  object  or  scene 
depicted  in  the  anamorphic  painting  occurs  only  when  the  painting 
is  viewed  so  that  the  projection  to  the  eye  is  similar  to  the  pro- 
jection from  the  real  object,  i.e.,  when  geometric  similarities 
exist  between  the  object  and  its  representation. 

Because  the  precise  relationships  between  viewing  point  dis- 
locations, geometric  transformations,  and  perceptual  accuracy  is 
unknown,  determination  of  the  correct  viewing  point  for  a display 
and  of  the  effects  of  displacement  is  crucial  for  a further  under- 
standing of  picture  perception.  Clearly  any  discussion  of  the  way 
in  which  the  visual  system  treats  pictures,  as  well  as  any  investi- 
gation of  "pictorial  compensation"  must  be  based  on  knowledge  of  the 
transformations  induced  by  viewing  point  displacement.  In  spite  of 
this,  there  have  been  relatively  few  attempts  to  specify  or  control 
such  transformations. 

In  the  present  chapter,  we  discuss  the  geometric  effects  of 
changing  the  viewing  point  of  a picture,  and  consider  the  theoretical 
and  empirical  status  of  the  concept  of  "pictorial  constancy."  For 
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the  purposes  of  this  discussion  we  adopt  the  convention  of  speaking 
in  terms  of  photography  and  photographic  images;  of  course,  identical 
considerations  hold  for  any  geometrically  representative  display 
that  has  a single  center  of  projection,  such  as  TV  or  radar  images, 
line  drawings,  computer  graphics,  etc.  We  distinguish  between  the 
optic  arrays  produced  by  the  picture  and  by  the  environment  by 
referring  to  the  pictorial  and  environmental  arrays.  For  a picture 
viewed  from  a given  point,  the  spatial  layout  that  could  have  gen- 
erated an  array  equivalent  to  that  pictorial  array  will  be  called 
the  virtual  space.  Thus  for  each  viewing  point,  there  exists  a 
corresponding  virtual  space.  By  definition,  then,  the  "correct" 
viewing  point  for  a picture  is  one  in  which  the  virtual  space  is 
identical  with  the  geographic  space  that  originally  generated  the 
picture,  i.e.,  when  the  pictorial  and  environmental  arrays  are  iso- 
morphic. 

To  obtain  this  identity  in  viewing  a photograph,  the  relation- 
ship between  the  eye  and  the  photograph  must  preserve  the  relation- 
ships that  exist  between  the  lens  node  and  the  film  plane.  With  a 
rigid  (box)  camera,  the  lens  axis  is  centered  on  a line  perpendicular 
to  the  center  of  the  film  plane,  so  the  correct  viewing  point  must 
be  along  a line  perpendicular  to  the  center  of  the  photograph.  The 
distance  of  the  correct  viewing  point  along  this  line  is  equal  to 
the  actual  focal  length  of  the  lens  multiplied  by  the  degree  of 
enlargement  of  the  photograph.  Thus,  if  the  focal  length  of  the 
lens  when  the  picture  was  taken  was  70  mm,  and  the  negative  is 
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enlarged  8X,  then  the  correct  viewing  distance  is  560  mm.  Mote 
that  the  correct  distance  is  jointly  a function  of  the  focal  length 
and  the  degree  of  enlargement.  Viewing  distance  for  a particular 
photograph  is  too  close  or  too  far  away  only  with  reference  to  lens 
and  enlargement.  An  implication  of  this  fact  is  that  geometrically, 
transformations  are  not  solely  due  to  the  use  of  particular  lenses, 
but  to  the  combination  of  lenses  and  viewing  point.  Thus,  the 
so-called  telephoto  effect  is  not  the  result  of  using  a telephoto 
lens,  but  rather  of  viewing  a photograph  from  an  incorrect  (too 
near)  point  given  a particular  lens. 

Similar  considerations  hold  for  determining  the  correct  view 
point  for  a picture  taken  with  a flexible  (view)  camera.  All 
spatial  relations  between  the  lens  axis  and  the  fiirn  plane  must 
be  identical  to  the  relations  between  the  line  of  sight  and  the 
photograph.  Thus,  a 3 cm  rise  of  the  film  plane  puts  the  correct 
viewing  point  3 cm  (times  the  degree  of  enlargement)  above  the 
center  of  the  photograph.  The  pictorial  and  environmental  arrays 
are  identical  if  the  distance  and  orientation  of  the  photograph  rela- 
tive to  a line  from  the  eye  to  the  center  of  the  photo  are  identical 
to  the  distance  and  orientation  of  the  film  plane  relative  to  the 
line  from  the  lens  node  to  the  center  of  the  negative. 

If  these  conditions  are  not  maintained,  then  the  pictorial 
and  environmental  arrays  will  differ,  and  the  virtual  space  pro- 
jected from  the  picture  will  differ  from  the  environmental  space. 
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One  way  to  define  the  geometric  distortions  that  result  from 
changing  the  viewing  point  is  to  describe  the  virtual  space  that 
is  produced.  To  demonstrate  this  point,  let  us  consider  the  object 
in  Figure  1.  If  this  drawing  is  viewed  from  the  correct  point 
along  a line  perpendicular  to  the  center  of  the  photo,  then  the 
virtual  object  for  the  pictorial  array  is  a cube.  If  the  photo  is 
viewed  from  another  point,  the  virtual  object  for  that  projection 
will  not  correspond  to  a cube,  but  to  some  other  object.  If  we 
can  describe  this  other  object,  then  we  will  have  described  the 
effects  of  the  geometric  transformation. 

This  section  of  the  report  will  examine  the  geometrical  basis  of 
pictorial  space  perception  via  linear  perspective  representations.  We 
will  consider  photographs,  drawings,  paintings,  or  other  representa- 
tions which  can  be  analyzed  by  the  principles  of  linear  perspective, 
describe  the  rules  by  which  the  virtual  space  specified  by  such  a 
picture  can  be  determined,  and  show  how  the  geometry  of  the  virtual 
space  is  affected  by  the  location  of  the  point  of  observation.  In 
particular,  the  way  that  "incorrect"  points  of  observation  produce 
specific  transformations  of  the  geometry  of  the  virtual  space  will 
be  explained.  Although  our  analyses  will  be  developed  for  linear 
perspective,  the  same  general  principles,  and  the  same  conclu- 
sions, apply  to  pictorial  representations  of  spatial  layout  in 
general,  as  we  have  shown  elsewhere  (Farber  and  Rosinski,  1978). 
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Decoding  Spatial  Layout  through  Linear  Perspective 

The  rules  of  linear  perspective  can  be  used  to  represent 
a desired  spatial  layout.  A picture  can  be  constructed  using  these 
rules,  so  that  it  projects  the  same  array  to  some  point  as  a real 
scene.  This  is  a case  of  using  linear  perspective  for  the  synthesis 
of  a spatial  representation.  The  same  principles  can  be  used  to 
analyze  a given  representation,  i.e.,  to  determine  the  corresponding 
spatial  layout,  given  the  picture  and  point  of  observation.  It 
is  possible  to  determine  the  three  dimensional  shape,  the  relative 
sizes,  and  the  orientations  of  objects  in  a picture  if  the  scene 
lends  itself  to  linear  perspective  analysis.  We  will  first  review 
some  rules  for  decoding  layout  from  linear  perspective,  and  then 
consider  their  implications  for  virtual  space  projected  to  "incorrect" 
points  of  observation. 

In  a linear  perspective  representation,  parallel  lines  converge 
to  a common  vanishing  point.  There  exists  a point  in  the  picture 
plane  (corresponding  to  the  "point  at  infinity"  in  space)  which  is 
common  to  all  parallel  lines.  The  direction  of  the  vanishing  point 
relative  to  the  point  of  observation  is  the  common  direction  of  all 
lines  in  the  sheaf.  Consequently,  the  direction  of  the  vanishing 
point  specifies  the  direction  or  orientation  of  the  set  of  parallel 
1 ines. 

Further,  lines  that  are  coplanar  converge  to  vanishing  points 
that  are  collinear.  The  lines  on  a plane  all  have  vanishing  points  that 
are  on  a line  in  the  picture  plane.  Thus,  there  is  a horizon  line 
corresponding  to  each  plane. 
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These  facts  are  sufficient  to  deal  with  most  cases  of  general 
interest.  Figure  2 is  the  projection  of  a simple  ground  plane,  con- 
sisting of  a regular,  rectangular  texture.  There  is  a central  vanish- 
ing point  at  V1 , the  intersection  of  one  family  of  parallel  sides.  The 
orthogonal  sides  also  project  into  lines  that  intersect  in  a vanishing 
point  V^;  but  the  vanishing  point  for  these  lines  cannot  be  represented 
in  the  figure,  since  it  is  at  infinity  in  the  plane  of  the  illustration. 
There  are  two  other  implicit  vanishing  points  which  are  of  particular 
importance  for  our  discussion:  and  are  the  vanishing  points 

for  the  diagonals  of  the  ground  lattice.  Although  the  diagonals  are 
not  directly  represented  in  the  illustration,  they  do  intersect  at 
V3  and  V4,  because  for  a regular  ground  texture,  the  diagonals  form 
two  sets  of  parallel  lines. 

In  order  to  determine  the  geometry  of  the  space  of  Figure  2,  we 
need  a scale  for  depth.  That  is,  we  need  to  know  the  depth  shape  of 
the  ground  elements;  the  ratio  of  depth  to  width.  This  requires 
that  we  specify  a point  of  observation.  We  assume  the  viewing  point 
for  Figure  2 is  at  0.  Note  that  although  we  have  represented  0 in 
the  illustration,  it  does  not  exist  in  the  actual  picture  plane. 

We  can  now  determine  the  shape  of  the  ground  elements.  This  is 
not  trivial,  since  trapezoidal  figures,  even  if  we  assume  them  to 
correspond  to  rectangles,  might  correspond  to  rectangles  of  a wide 
range  of  shapes  (d/w  ratios).  In  fact,  only  for  a specific  center  of 
projection  does  the  figure  represent  a square  ground  lattice.  This 
point  is  located  where  the  angle  between  the  observation  point  and  the 
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Figure  2 

Schematic  representation  of  the  geometric  relations  important  in  an 
illustration  of  a ground  plane  of  regular  elements.  V],  V2,  V-,,  and 
V4  are  vanishing  points  of  the  surface.  The  observation  points,  0, 

O',  and  0"  do  not  lie  in  the  picture  plane,  but  are  represented  in  this 
illustration  to  depict  angular  relations. 
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primary  vanishing  points  V1  and  (the  angle  <V]  0 V^)  is  90°; 
therefore  the  sides  of  the  ground  texture  elements  are  orthogonal, 
and  their  shape  is  rectangular.  If  the  angles  between  the  diagonals 
and  the  main  vanishing  points  ( <V ^ 0 V^  or  <V4  0 V-j ) are  45°,  each 
element  corresponds  to  a square,  rather  than  any  other  rectangle. 

Of  course,  the  absolute  size  of  the  ground  elements  is  not  specified, 
unless  ae  make  some  assumption  about  the  height  of  the  original  center 
of  projection  above  the  ground  plane,  but  the  relative  sizes  and  dis- 
tances are  fully  determined  by  the  angular  relations  among  the  vanish- 
ing points  and  the  viewing  point. 

In  a similar  fashion,  the  orientation  of  planar  surfaces  can 
be  analyzed  in  terms  of  vanishing  point  relationships  in  the  pictorial 
array.  In  the  environment,  a line  from  the  vanishing  point  of  a plane 
surface  to  the  eye  must  run  parallel  to  the  surface  in  space.  Thus, 
the  orientation  of  the  lines  from  the  eye  to  the  various  vanishing 
points  relative  to  the  gravitational  coordinate  system  is  identical 
with  the  orientation  of  the  surface  in  space.  If  a picture  is  viewed 
from  the  correct  viewing  point,  lines  from  the  eye  to  the  pictorial 
vanishing  points  have  the  same  orientation  as  the  represented  surfaces. 

Effects  of  Viewing  Point  Dislocation 

As  the  preceding  analysis  shows,  if  a picture  is  viewed  *rom  the 
correct  point  of  observation,  the  angular  relations  between  the  view- 
ing point  and  the  various  vanishi  g points  specify  the  internal  depth, 
relative  size,  shape,  and  orientation  for  depicted  layouts.  Clearly, 
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then,  if  one  changes  the  position  of  the  viewing  point  relative  to  the 
photo  these  angular  relations  are  altered;  and  the  pictured  space  is 
geometrically  transformed. 

Magnification 

If  the  same  picture  is  viewed  from  a different  point,  a different 
virtual  space  is  specified.  For  example,  if  the  center  of  projection 
were  at  O'  rather  than  0 (on  the  same  perpendicular  line,  but  closer 
to  the  picture  plane),  the  diagonal  vanishing  point  angles  would  be 
shifted.  In  particular,  if  the  distance  from  V-|  to  0 =,z,  and  if  the 
distance  V-|  O'  = z',  then  the  angle  <V^  0 is  increased  relative  to 
<Vj  0 V-| , by  a factor  depending  on  the  ratio  z'/z.  If  we  call  these 
angles  9 and  9',  then 

tan  9'  = k tan  9, 

where  k = z/z'. 

In  the  present  case,  this  amounts  to  an  increase  in  the  diagonal 
angle  (Figure  3),  and  implies  a decrease  in  the  depth/width  ratio.  Hence, 
a decrease  in  the  distance  of  the  center  of  projection  corresponds  to 
a decrease  in  the  depth  scale  of  the  pictorial  space  (cf.  Purdy,  1958). 

So  far,  we  have  been  considering  the  analysis  of  a picture,  given 
a known  center  of  projection.  But  if  we  analyze  pictorial  represen- 
tations in  terms  of  the  corresponding  virtual  space,  the  same  conclu- 
sions should  follow  for  the  effects  of  viewing  a picture  from  a point 
of  observation  displaced  along  the  perpendicular.  Viewing  a picture 
from  a point  of  observation  closer  than  the  theoretical  center  of 
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The  angular  length-width  ratio  and  the  diagonal  angles  of  a ground  element 
define  its  shape.  Because  magnification  compresses  internal  depth  it  alters 

element  shape. 
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projection  produces,  at  the  eye,  an  array  corresponding  to  a space  of 
compressed  internal  depth;  viewing  from  too  far  produces  an  array 
corresponding  to  a space  with  an  expanded  internal  depth  scale. 

Although  we  have  been  concerned  with  a single  ground  plane,  the 
same  arguments  and  analyses  generalize  to  scenes  containing  planes 
in  different  orientations,  or  containing  objects  of  different  shapes, 
as  long  as  linear  perspective  still  applies.  For  example,  consider 
Figure  4 (a  slanted  lattice  against  a textured  ground).  The  ground 
plane  is  defined  as  in  Figure  2,  but  there  is  now  another  vanishing 
line,  corresponding  to  the  horizon  of  the  slanted  checkerboard.  The 
slant  of  the  plane  is  specified  by  the  direction  if  its  horizon.  For 
the  case  illustrated  in  Figure  4,  and  for  the  point  of  observation 
indicated  by  0,  the  slant  is  about  45°.  But  for  a nearer  center  of  pro- 
jection (point  of  observation),  the  elevation  of  the  horizon  is  in- 
creased--hence,  the  specified  slant  approaches  the  frontal -paral lei . 

In  addition  to  compressing  depth  in  the  ground  plane,  moving  the  point 
of  observation  closer  to  the  picture  also  changes  the  specified  slants 
of  other  planes. 

In  fact,  all  of  the  effects  of  changing  the  distance  of  the  point 
of  observation  can  be  summarized  by  the  statement  that  the  internal 
depth  of  the  corresponding  virtual  space  is  compressed  by  a factor 
k = z/z1.  This  implies  the  changes  of  slant,  shape,  and  internal  depth. 


Shear 

The  effects  we  have  discussed  are  produced  by  varying  the  distance 
of  the  center  of  projection  or  point  of  observation.  We  now  consider 
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A lattice-like  surface  lying  on  the  ground  plane  creates  a set  of  vanishing 
points,  V i\  v,',  V,  and  V/-  These  vanishing  points  help  specify  the 
surface's  characteristics  and  its  spatial  relationship  with  the  ground.  The 
observation  point,  0,  does  not  lie  in  the  picture  plane,  but  is  represented 
in  the  illustration  to  depict  angular  relations. 
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the  effects  on  the  virtual  space  of  lateral  displacements  of  the 
point  of  observation  (parallel  to  the  picture  plane),  as  illustrated 
by  point  0"  in  Figure  2.  Once  again,  we  need  only  consider  the  effects 
on  the  angular  relations  among  the  vanishing  points  and  the  point  of 
observation.  Obviously,  moving  0 to  0"  (where  0"  is  to  the  right 
of  0)  shifts  the  direction  of  , and  consequently  the  direction  of 
the  lines  intersecting  at  V-p  But  it  also  alters  the  angle  between 
V-|  and  The  angular  direction  of  is  unaffected  by  the  lateral 
shift,  but  V-j  is  shifted  leftwards;  so  the  angle  <V-|  0 is  now 
greater  than  90°.  This  means  that  the  sides  of  the  ground  elements 
are  no  longer  orthogonal.  The  "element"  of  ground  texture  is  no 
longer  a rectangle,  but  a parallelogram.  The  effect  of  the  lateral 
displacement  of  the  point  of  observation  is  to  produce  a shear  in  the 
virtual  space.  Of  course,  not  only  ground  elements,  but  all  angular 
relations  among  non-parallel  lines  (as  long  as  they  are  not  in  the 
same  frontal  plane)  are  sheared.  In  particular,  although  frontal- 
parallel  lines  and  planes  remain  frontal -parallel  under  a lateral 
shift  (because  their  vanishing  points  are  unaffected),  the  orientation 
of  non-frontal  planes  is  angularly  shifted  opposite  to  the  direction 
of  displacement. 

In  the  general  case,  both  perpendicular  and  lateral  shifts  of  the 
point  of  observation  may  occur.  It  is  intuitively  clear  that  the 
effects  described  here  should  add  together:  viewing  from  too  near  and 
too  far  to  the  right,  for  example,  should  produce  both  a depth  compres- 
sion (due  to  the  approach  along  the  perpendicular),  and  a lateral 
shear.  We  have  described  the  quantitative  relations  in  Farber  & 
Rosinski  (1978). 
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Figure  5a 
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Finure  5b 


V 


A schematic  side  view  depicting  geometric  relations  between  a point  of 
observation,  0,  and  the  picture  plane  n.  V is  the  vanishing  point 
for  a surface  depicted  in  the  picture.  A line  drawn  from  the  point  of 
observation  to  the  vanishing  point  makes  the  same  angle  9,  with  the 
perpendicular  as  the  surface  does. 
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The  picture  plane  has  been  rotated  so  that  it  makes  an  angle,  6,  with 
the  horizontal.  The  transformed  surface  slant,  9‘,  can  be  expressed 


and  as 


cot  9'  = 


sin  9 
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h sin  6 
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sin  9. 
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and  therefore, 


sin  9‘  = sin  9 


sin  3 

+ cos  6 sin 


Rotating  the  picture  plane  affects  the  relations  among  vanishing 
points  of  surface  element  diagonals.  In  a directly  analogous  fashion, 
such  change  can  be  analyzed  as  an  index  of  shape  distortion. 

It  is  important  to  note  once  again  that  the  effects  we  are  dis- 
cussing are  "geometrical,"  in  the  sense  that  the  transformations  of 
virtual  space  are  determined  on  the  basis  of  considerations  concern- 
ing the  geometry  of  projection,  and  not  on  the  basis  of  a theory  about 
the  operation  of  the  visual  system.  We  do  not  claim  that  the  trans- 
formations of  pictorial  space  which  are  geometrically  specified  nec- 
essarily occur  in  perception.  That  is  a separate,  empirical  matter. 
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Pictorial  Space  Perception 

The  experimental  literature  on  the  effects  of  spatial  transfor- 
mations on  pictorial  space  perception  is  small.  There  is,  however, 
convincing  evidence  that,  under  some  conditions,  perceived  and  virtual 
space  are  in  correspondence.  As  Gibson  (1971)  has  argued,  pictures 
can  be  representational  whenever  monocular  arrays  from  a picture  and 
from  the  world  coincide.  This,  of  course,  corresponds  to  our  defini- 
tion for  viewing  a picture  from  the  correct  station  point.  Thus  for 
Gibson,  representational ity  implies  that  perceived  and  virtual  space 
are  isomorphic. 

Such  isomorphism  exists,  at  least  for  viewing  under  specialized 
conditions.  Schlosberg  (1941)  suggested  that  photographs  can  lead  to 
compelling  and  highly  realistic  impressions  of  spatial  layout  under 
conditions  which  restrict  the  monocular  field  of  view  to  the  photo- 
graph. Gibson  (1951)  had  observers  view  a corridor  and  a photomural 
of  that  same  corridor.  The  observer's  view  in  both  conditions  was 
through  a small  aperture  that  restricted  the  field  of  view  and  kept 
the  eye  at  the  correct  point  of  observation.  Observers  were  asked  to 
decide  which  of  the  two  corridors  was  real  and  which  was  a picture. 
Gibson  reported  that  only  60  percent  of  the  observers  were  correct. 

The  results  were  surprising  in  view  of  the  fact  that  the  two  stimulus 
situations  differed  in  overall  brightness,  contrast,  and  color. 

Since  the  virtual  space  projected  by  the  picture  was  identical  to  the 
environmental  space  only  in  terms  of  their  geometry,  difficulty  in 
discriminating  the  two  indicates  that  geometric  factors  played  a large 
role  in  the  perceptual  discrimination  of  the  two  conditions. 
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A similar  experiment  by  Smith  and  Smith  (1961)  illustrated  this 
point  more  dramatically.  In  Smith  and  Smith's  experiment,  people 
looked  through  an  eye  piece  positioned  at  the  correct  viewing  points 
and  viewed  a photograph  of  a room  containing  a target  on  the  floor. 

The  observers  were  asked  to  toss  a ball  into  the  target  under  condi- 
tions that  restricted  their  view  of  the  picture  after  the  ball  had  been 
tossed.  There  were  two  important  aspects  of  the  results  of  this  study. 
First,  the  ball  tossing  responses  of  the  observers  were  all  quite 
accurate.  In  all  cases,  the  ball  would  have  landed  on  or  near  the 
target.  The  virtual  location  of  the  target  in  the  pictorial  space, 
then,  determined  the  perceived  location  of  the  target.  More  strikingly. 
Smith  and  Smith  report  that  although  no  attempt  was  made  to  achieve 
versimilitude  (beyond  assuring  that  the  geometrical  structure  of  the 
pictorial  array  matched  that  of  the  environmental  array),  none  of 
the  observers  reported  knowing  that  he  was  lookinq  at  a Dhotoqraoh  rather 
than  at  the  actual  room.  Thus,  both  in  terms  of  qualitative  impres- 
sions as  well  as  quantitative  accuracy,  the  structure  of  the  geometric 
array  is  an  important  determiner  of  a pictoral  space  perception. 

This  fact,  however,  raises  a difficult  problem  for  an  information- 
based  theory  of  picture  perception.  If  there  is  an  isomorphism  be- 
tween perceived  space  and  virtual  space,  pictures  should  be  accurately 
representational  only  when  they  are  viewed  from  the  correct  center  of 
projection.  Each  projection  to  other  viewing  points  specifies  a 
unique  virtual  space,  and  each  viewing  point  should  give  rise  to  its 
own  unique  perceived  space. 
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perceptual  anomaly.  It  has  been  difficult  to  determine  the  nature  of 
this  mechanism,  bee;.  .se  an  analysis  of  the  distortions  of  virtual  space 
has  only  recently  been  generally  available. 

An  inability  to  compare  perceived  space  with  virtual  space  makes 
it  impossible  to  specify  precisely  the  extent  and  nature  of  percep- 
tual compensation.  Assertions  regarding  the  existence  of  compensa- 
tion may  be  in  error  if  crucial  relations  among  perceived,  virtual,  and 
environmental  spaces  are  not  determined.  Yet,  virtually  every  writer 
on  pictorial  distortion  (the  present  ones  included)  has  appealed  to 
the  reader's  intuitions.  For  example,  Haber  (1978,  p.  41)  in  discussing 
expected  perceptions  of  a distorted  pictorial  space  argues  that  "most 
picture  lookers  know  that  this  does  not  happen."  It  is  worth  pointing 
out  that  neither  such  casual  phenomenology  nor  the  more  experimental 
phenomenology  of  Pirenne  is  relevant  here.  The  fact  that  observers 
are  not  consciously  aware  of  distortions  in  virtual  space  does  not 
imply  that  the  nature  of  virtual  space  is  unregistered  by  the  visual 
system.  Further,  one's  introspections  about  the  nature  of  perceptual 
distortions  are  irrelevant.  To  comment  on  whether  a picture  seems 
distorted  is  to  assess  the  correspondence  between  virtual  and  environ- 
mental space.  A judgment  of  a distortion  of  space  implies  that  virtual 
space  is  registered  and  somehow  compared  to  environmental  space.  But, 

I cannot  judge  that  this  scene  is  distorted  unless  I know  what  it  is 
supposed  to  look  like.  This  information  is  not  available  at  the  in- 
correct viewing  point.  Logically,  one's  estimate  of  the  distortion 
present  in  virtual  space  can  not  be  accurate  unless  an  impossible  ob- 
ject results. 
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A second  kind  of  logical  problem  has  arisen  because  of  diffi- 
culty in  specifying  the  virtual  and  environmental  spaces,  and  assessing 
their  effects  on  perception.  For  example,  dislocation  of  viewing  point 
does  not  affect  the  relative  sizes  of  surfaces  lying  in  planes  parallel 
to  the  picture  plane.  Relative  sizes  in  any  virtual  space  always  coin- 
cide with  relative  sizes  in  the  environmental  space.  Consequently, 
the  perceived  relative  size  of  objects  in  photographs  is  logically 
irrelevant  to  the  problem  of  determining  the  nature  of  perceptual 
compensation.  The  concept  of  compensation  only  need  be  invoked  when 
the  virtual  space  and  environmental  space  differ.  This  makes  it  es- 
pecially important  to  determine  whether  alterations  of  virtual  space 
are  reflected  in  the  perception  of  space. 

Effects  of  Magnification 

One  of  the  clearest  demonstrations  of  the  effects  of  magnifying 
the  pictorial  array  was  provided  by  Purdy  (1960).  Purdy  had  his  obser- 
vers make  reproduction  judgments  of  the  slant  of  a textured  surface 
under  experimental  conditions  in  which  there  was  either  no  magnifica- 
tion or  a 1.5  magnification.  In  addition  to  determining  whether  the 
conditions  differed,  or  whether  the  data  deviated  from  the  predictions, 
Purdy  used  an  ingenious  procedure  to  eliminate  the  effects  of  constant 
errors.  Since  a slant  of  40.9°  (0.713  rad)  under  a 1.5  magnifica- 
tion geometrically  specifies  a surface  at  52.4°(0.914  rad),  Purdy 
had  his  observers  judge  both  the  40.9°  (0.713  rad)  surface  under  1.5 
magnification  as  well  as  52.4°  (0.914  rad)  surface  without 
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magnification.  Thus,  a direct  wi thin-subject  comparison  of  the 
perceived  slant  could  be  obtained.  The  two  magnifications  were 
created  by  maintaining  a constant  distance  from  the  observer  to  the 
display,  but  changing  the  location  of  the  center  of  projection-. 

Thus  available  information  specified  the  virtual  orientation  of  the 
surface. 

Purdy  found  a significant  difference  between  judgments  with  and 
without  magnification.  Further,  the  average  deviation  between  the 
judged  surface  slant  under  magnification  and  the  geometrically  pre- 
dicted judgment  without  magnification  was  only  1.05°  (0.018  rad). 

Thus,  there  was  a remarkably  close  correspondence  between  the  perceived 
slant  of  the  surface  and  the  virtual  slant  specified  by  the  structure 
of  the  magnified  pictorial  array. 

A similar  correspondence  has  been  found  in  other  studies.  Smith 
and  Gruber  (1958)  used  a direct  scaling  technique  to  determine  the 
effect  of  viewing  distance  on  the  perception  of  distance  in  photographs. 
On  each  trial,  observers  viewed  a corridor  and  a photomural  of  the  cor- 
ridor through  an  aperture.  The  experimenter  designated  a point  in 
the  photo,  and  the  observer  expressed  the  distance  of  that  point  as  a 
fraction  or  multiple  of  the  perceived  distance  to  the  corresponding 
point  in  the  real  corridor.  The  correct  geometric  viewing  point  for 
the  photomural  was  at  a distance  of  2.1  meters.  The  actual  viewing 
distances  used,  however,  were  1.0,  1.3,  1.6,  2.0,  2.4,  and  2.8  meters. 
Viewing  from  these  observation  points  should  result  in  compression  or 
dilation  of  the  virtual  space  depicted  in  the  photo'graph.  Smith  and 
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Gruber  found  that  the  judgments  did  differ  significantly  from  one  another 
across  viewing  distance.  In  addition,  by  reanalyzing  their  data,  we 
can  see  that  the  ratio  judgments  actually  made  in  the  various  condi- 
tions closely  matched  those  predicted  by  the  distortion  of  the  virtual 
space.  Since  k = 2.1  at  the  nearest  viewing  distance,  and  k = .75 
at  the  farthest  distance,  the  consequent  compressions  and  dilations 
of  virtual  space  differ  by  a factor  of  2.85.  In  fact,  actual  data  for 
these  conditions  differed  by  a factor  of  2.86.  Smith  and  Gruber  point 
out  that  within  any  of  the  viewing  conditions,  the  obtained  judgment 
did  not  differ  by  more  than  6 percent  from  the  predicted. 

The  magnifications  in  the  Smith  and  Gruber  study  were  induced  by 
varying  the  distance  between  the  observer  and  the  display.  However, 
viewing  was  through  a reduction  aperture  which  reduced  display  plane 
information.  Thus,  in  terms  of  both  optical  information  and  results, 
the  Purdy  study  and  the  Smith  and  Gruber  study  are  consonant. 

In  a further  experiment.  Smith  (1958a)  had  observers  view  a similar 
photomural  from  two  distances,  and  to  estimate  the  number  of  paces 
that  would  be  needed  to  go  from  their  position  to  a set  of  pipes  at 
the  end  of  the  hall.  As  one  might  expect  from  the  dilation  of  virtual 
space,  observers'  judgments  of  these  distances  varied  directly  with  the 
viewing  distance  of  the  photograph.  Hov/ever,  the  match  between  geo- 
metric predictions  and  the  actual  judgments  was  not  as  close  in  this 
study  as  in  the  previous  ones.  Given  the  two  viewing  distances  used, 
the  judgments  from  the  far  viewing  point  should  have  been  about  4.5 
times  those  from  the  near  point.  Although  the  judgments  differed  in 
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the  predicted  direction,  they  differed  only  by  a factor  of  about  2.5 
These  results  suggest  that  the  geometry  of  virtual  space  affected  the 
perception  of  distance,  but  that  other  factors  may  have  been  operating 
to  reduce  the  effect  relative  to  that  theoretically  expected. 

A similar  conclusion  can  be  drawn  from  another  study  performed 
by  Smith  (1958b).  In  this  experiment,  people  viewed  photographs  of 
a plowed  field  with  15  white  stakes  of  varying  heights  positioned 
in  the  foreground  and  with  a single  white  stake  positioned  in  the  back- 
ground. The  observers  viewed  these  photographs  at  either  75  percent 
(k  = 1.33)  or  250  percent  (k  = 0.40)  of  the  geometrically  correct 
viewing  distance  (resulting  in  a magnification  or  minification  respec- 
tively), and  made  judgments  of  the  height  and  distance  of  the  stake  in 
the  background.  The  geometrical  analysis  presented  above  indicates 
that  under  these  two  conditions,  the  virtual  space  of  the  photograph 
should  be  compressed  and  dilated.  Qualitatively,  the  results  followed 
this  pattern.  Judgments  of  distance  in  the  two  viewing  conditions 
differed  by  a factor  of  2.7,  with  the  mean  judgments  at  the  75. percent 
viewing  distance  to  be  in  underestimation  and  those  from  the  250 
percent  distance  to  be  overestimations  of  the  actual  distance  of  the 
target  object.  Thus,  these  results  generally  follow  expectations  based 
in  the  geometric  deformation  of  the  virtual  pictorial  space. 

However,  a precise  reanalysis  of  the  Smith  data  indicates  that 
the  match  between  the  geometrically  expected  results  and  the  data  is 
not  very  close  (Figure  6).  For  example,  the  average  distance  of 
the  standard  was  203  yards  (18Gm).  Under  the  viewing  conditions  used 
in  the  study,  the  geometric  compression  of  virtual  space  predicts 
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Figure  6 

Mean  judged  distance,  and  predictions  based  on  the  geometry  of 
virtual  space  for  the  physical  distances  used  in  Smith's  experiment. 


1/1.33  (times  203)  = 152  and  507  yards  (464m)  in  the  250  percent  con- 
dition, since  1/0.4  (times  203)  = 507.  In  fact,  the  actual  means  for 
these  two  conditions  were  110  (100.5m)  yards  and  297  yards  (271.5m) 
respectively.  Thus,  it  is  clear  that  there  was  considerable  under- 
estimation relative  to  the  geometrically  predicted  distances.  Secondly, 
there  are  reasons  to  suppose  that  the  effects  of  the  250  percent  mini- 
fication  condition  were  even  less  than  the  above  comparisons  suggest. 

Smith  did  not  use  a control  condition  in  which  observers  viewed 
the  same  photographs  from  the  correct  station  point.  However,  Gibson 
and  Smith  (unpublished)  did  perform  such  a study  using  the  same  photo- 
graphs later  used  by  Smith  (1958b).  Smith  himself  used  this  other 
set  of  data  as  a comparison  for  magnification  or  mini fication.  For 
an  average  distance  of  the  standard  of  203  yards  (136m),  the  mean 
distance  judgment  when  viewing  the  photograph  from  the  correct  station 
point  was  261  yards  (289m).  Thus,  when  viewing  from  the  correct  point, 
observers  overestimated  the  distance  of  the  standard.  The  virtual  space 
projected  to  the  correct  viewing  point,  or  with  250  percent  minifica- 
tion  differ  by  304  yards  (278m).  Yet  as  shown  in  Figure  6 the  actual 
difference  between  these  two  conditions  was  only  36  yards  (33m), 
scarcely  12  percent  of  the  predicted  difference.  The  distance  judg- 
ment phase  of  the  Smith  (1958b)  study  demonstrates  that  magnification 
of  the  pictorial  array  does  exert  an  influence  on  the  perception  of 
distance,  when  magnification  is  induced  by  moving  the  viewing  point 
relative  to  the  picture.  However,  the  correspondence  between  the 
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obtained  and  predicted  judgments  is  so  slight  as  to  suggest  that  other 
factors  are  influencing  the  judgment.  Unfortunately,  the  stimulus 
conditions  are  not  specified  in  sufficient  detail  to  determine  exactly 
what  other  factors  might  have  been  operating. 

The  Smith  (1958b)  study  was  also  constructed  to  determine  the 
effects  of  magnification  and  minification  on  the  perception  of  size. 
According  to  the  analysis  of  the  virtual  space  presented  above,  the 
relative  frontal  size  of  the  object  in  the  virtual  space  is  not 
affected  by  viewing  distance  to  the  photograph.  Subjects  made  size 
judgments  by  matching  the  object  in  the  distance  with  one  of  15 
objects  in  the  foreground.  Smith  found  an  extremely  close  correspon- 
dence between  the  judged  and  actual  size  of  the  stimulus  object,  which 
was  not  affected  by  the  viewing  distance.  The  actual  size  of  the 
target  object  in  Smith's  study  was  69  inches  (175.26  cm)  high.  Sub- 
jects in  the  magnification  and  minification  conditions  made  matching 
(relative  size)  judgments,  and  selected  objects  with  mean  sizes  of 
68.8  and  68.4  inches  (174.25cm  and  173.74cm)  respectively.  Thus, 
regardless  of  the  magnification,  there  was  no  effect  on  the  perceived 
size  of  the  target  object.  It  should  be  noted,  however,  that  although 
the  data  and  predictions  regarding  size  perception  match  exactly,  the 
Smith  (1958b)  data  must  be  regarded  as  only  weak  support  of  the  propo- 
sition that  the  virtual  space  affected  the  judgment  of  size.  As  Smith 
points  out,  the  judgments  of  size  and  distance  appear  to  be  indepen- 
dent. Although  perceived  and  virtual  size  correspond,  perceived 
and  virtual  distance  do  not.  The  exact  reason  for  this  lack  of  a 
relationship  cannot  be  directly  determined,  but  it  seems  clear 


that  some  source  of  relational  information  for  size  may  have  played 
a role  in  the  judgments  (cf.  Sedgwick,  1973).  The  combination  of 
the  virtual  size  in  the  pictorial  space  with  relational  information 
could  have  resulted  in  near  prefect  judgments  since  relational 
size  is  not  affected  by  magnification. 

Effects  of  Shear 

There  are  few  experiments  on  the  effects  of  the  shear  transforma- 
tion on  pictorial  space  perception,  perhaps  because  of  the  relative 
complexity  of  these  geometrically  predicted  transformations  of  virtual 
space.  To  our  knowledge,  there  are  no  studies  of  the  effects  of  shear 
on  the  perception  of  distance  or  orientation.  The  existing  studies 
address  themselves  to  questions  of  relative  size  and  shape  perception. 

Perkins  (1972,  1973)  conducted  two  studies  relevant  to  the  question 
of  the  effects  of  the  shear  transformation  on  the  perception  of  shape. 

In  the  first,  Perkins  (1972)  had  individuals  judge  whether  drawn  paral- 
lelepipeds were  rectangular  or  nonrectangular.  Line  drawings  of  boxes 
were  constructed  such  that  half  of  them  could  not  have  been  geometric 
projections  of  rectangular  parallelepipeds.  Judgments  were  highly 
accurate.  In  general,  a line  drawing  of  a box  could  be  a representa- 
tion of  a rectangular  solid  only  when  the  drawing  could  have  been  the 
geometric  projection  of  a rectangular  object.  Perkins  suggested  that 
in  the  perception  of  shape,  the  visual  system  imposes  geometric  regu- 
larities (such  as  symmetry  and  rectangularity)  on  the  object,  but  only 
to  the  extent  that  such  regularities  are  consistent  with  the  geometric 


projection.  Thus,  the  structure  of  the  pictorial  array  sets  a limit 
upon  the  perception  of  stimulus  regularities. 
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In  a second  experiment,  Perkins  (1973)  used  the  identical 
stimulus  materials,  but  had  his  observers  view  the  pictures  obliquely 
from  angles  41°  (.72  rad)  or  26°  (.45  rad)  relative  to  the  picture 
plane.  As  in  the  previous  experiment,  the  observers'  task  was 
simply  to  judge  whether  the  represented  box  was  rectangular.  The 
specific  oblique  viewing  angles  were  chosen  to  determine  the  effect 
of  the  pictorial  array  on  the  judgment.  Under  these  viewing  con- 
ditions boxes  that  were  orthogonally  rectangular  (i.e.,  satisfied 
a qeometric  criterion  of  rectangularity  when  viewed  normally) 
could  be  either  rectangular  or  nonrectangular  in  the  virtual  space. 
Similarly,  some  of  the  orthogonally  nonrectangular  boxes  (i.e., 
those  that  would  not  meet  the  rectangularity  criterion  under  normal 
viewing)  could  be  rectangular  or  not  in  the  virtual  space.  Thus, 
Perkins  was  able  to  create  a conflict  situation  between  the  virtual 
object  specified  by  the  pictorial  array  and  the  object  represented 
in  the  drawing. 

There  are  several  noteworthy  aspects  of  the  Perkins  data.  First, 
the  accuracy  of  judgment  (defined  as  judgments  in  accordance  with 
the  orthogonal  classification,  or  in  accordance  with  the  depiction) 
was  less  for  oblique  viewing  than  for  perpendicular  viewing.  Gen- 
erally, the  transformation  of  the  virtual  shape  of  the  object  did 
affect  perceptual  judgments.  However,  this  conclusion  must  be 
qualified.  When  the  picture  was  viewed  at  41 0 (.72  rad)  from  the 
picture  plane  there  was  no  effect  of  the  virtual  space  on  judgment. 
Whether  the  virtual  object  projected  to  the  eye  was  rectangular  or 
not,  did  not  affect  judgments.  It  is  clear  then,  as  Perkins  points 
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out,  that  in  this  case  individuals  were  unaffected  by  or  able  to 
discount  the  effects  of  the  projective  transformation  of  the  virtual 
space  induced  by  the  dislocation  of  the  viewinq  point.  In  the 
26°  (.45  rad)  viewing  condition,  however,  the  conflict  between 
the  orthogonal  and  the  virtual  object  resulted  in  substantially 
inferior  performance.  In  this  condition,  there  was  clear  evidence 
that  the  perception  of  shape  was  affected  by  the  structure  of  the 
virtual  space.  Thus,  the  perception  of  shape  is  partly  determined 
by  nonoptical  factors  and  is  not  in  simple  correspondence  with  the 
virtual  space,  but  the  geometric  transformation  does  exert  some 
influence  on  judgment. 

Perceptual  Compensation  for  Geometric  Distortion 

It  is  unfortunate  that  there  is  some  inconsistency  among  studies 
performed  in  the  past.  Under  some  conditions,  space  perception  of 
transformed  pictures  seems  to  be  directly  influenced  by  the  geometry 
of  the  virtual  space  represented  by  the  pictorial  array.  Other 
research  indicates  that  factors  in  addition  to  the  simple  geometry 
of  the  virtual  space  affect  perception.  Of  course,  the  differences 
among  studies  must  be  directly  related  to  display  content  and  view- 
ing conditions.  Unfortunately,  many  of  the  early  experiments  are 
nbt  sufficiently  descriptive  to  allow  us  to  unequivocally  explain 
discrepancies  among  studies.  Some  provide  virtually  no  information 
regarding  viewing  conditions,  and  others  provide  insufficient  des- 
criptions to  allow  us  to  determine  exactly  what  information  was 
available  to  the  observer. 
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There  are,  however,  common  threads  throughout  the 
literature  which  suggest  that  there  are  certain  constraints  on  the 
operation  of  a compensation  process.  First,  it  appears  that  per- 
ception of  familiar  objects  or  objects  within  a familiar  spatial 
context  is  relatively  unaffected  by  distortion  of  virtual  space. 
Familiarity  apparently  overrides  distortions  in  the  array.  Sec- 
ondly, perception  seems  to  be  in  greater  correspondence  with  the 
virtual  space  when  picture  plane  cues  are  eliminated  or  minimized, 
or  when  the  optic  distortion  is  not  induced  by  charges  in  viewing 
point  (as  in  Purdy's  experiment  discussed  above). 

We  can  see  why  familiarity  and/or  availability  of  display  plane 
information  moderates  the  effect  of  distortion  by  analyzing  the 
nature  of  pictorial  perception.  Operations  and  states  involved 
are  represented  in  Figure  7.  In  the  simplest  case,  there  exists 
an  environment  represented  in  the  display.  If  the  display  is 
viewed  from  the  correct  point  the  transformation  is  an  isomorphic 
one,  and  virtual  space  and  environmental  space  are  identical. 
Perceived  space  corresponds  to  environmental  space  within  the 
limits  of  the  pick-up  process,  and  there  is  no  need  for  any  com- 
pensation. This  is,  of  course,  the  simplest,  most  straight-forward 
circumstance,  and  one  that  corresponds  to  one  aspect  of  the  Gib- 
son i an  position. 


The  intriguing  problem  occurs  when  we  introduce  a transforma- 
tion. How  is  it  possible  for  perceived  space  to  correspond  to  the 
environmental  rather  than  virtual  space?  One  trivial  possibility 
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is  that  familiarity  overrides  any  distortion  of  virtual  space.  We 
know  somehow  that  this  is  a picture  of  our  friend  and  see  her 
despite  the  distortion.  This  is  scarcely  perception,  of  course, 
but  more  like  social  cognition  (cf.  the  Honi  phenomenon). 

As  a second  method  of  accounting  for  "compensation  phenomena," 
we  hypothesize  a passive  perceptual  mechanism  that  renders  the 
visual  system  relatively  insensitive  to  distortions  of  virtual 
space.  There  are  at  least  two  related  ways  that  this  could  be 
passively  accomplished,  either  through  perceptual  persistence  or 
categorization.  We  have  in  mind  a situation  analogous  to  that 
reported  by  Julesz  (1971)  for  binocular  vision.  In  stereopsis, 
if  two  stereogram  halves  are  fused,  disparity  may  be  increased  well 
beyond  Panum's  area  before  diplopia  results.  In  similar  fashion, 
once  an  object,  scene,  or  spatial  relation  is  identified  or  cate- 
gorized, distortions  of  virtual  space  must  be  extreme  before  they 
affect  judgments. 

Although  he  is  not  explicit  on  this  point,  this  would  appear 
to  be  a type  of  mechanism  consistent  with  the  findings  of  Perkins 
(1973).  Perkins  suggested  that  there  is  a minimal  geometric  cri- 
terion for  rectangularity.  Consider  a two-dimensional  line  drawing 
in  which  three  lines  meet  at  a point.  For  this  configuration  to 
be  a geometrically  possible  projection  of  a rectangular  corner, 
the  three  angles  must  all  be  greater  than  90  degrees;  or  in  a 
special  case,  two  of  the  angles  must  be  exactly  90  degrees. 
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If  an  object  satisfied  this  minimal  criterion,  it  would  be 
seen  as  rectangular.  Individuals  viewed  pictures  of  boxes  from 
viewing  angles  of  41  degrees  (.72  rad)  or  26  degrees  (.45  rad) 
from  the  picture  plane.  When  the  picture  was  viewed  from  the 
25  degree  (.45  rad)  location,  judgments  were  strongly  affected 
by  the  virtual  space.  Such  results  suggest  that  for  certain 
familiar  objects  such  as  parallelepipeds , categorical  perception 
may  occur.  As  Pirenne  (1970)  has  suggested,  shifting  perceived 
shape  away  from  the  familiar  category  may  take  a substantial  dis- 
tortion of  virtual  space. 

It  seems  also  that  such  a view  v/ould  be  consistent  with  a 
recent  portion  of  Gibson's  theorizing.  He  holds  that  under  certain 
conditions  (correct  viewing?)  the  pictorial  array  can  act  as  a 
surrogate  source  of  information.  Linder  other  circumstances  (those 
in  which  projective  equivalence  is  lacking)  pictures  may  simply 
function  as  mediators  or  symbols  for  objects  or  spatial  environ- 
ments. We  also  acknowledge  that  one  aspect  of  perception  of  pic- 
torial distortion  may  simply  be  a case  of  pattern  matching  or 
categorical  judgment. 

However,  a more  interesting  aspect  of  compensation  is,  How 
might  an  unfamiliar  scene  be  treated?  It  could  be  recognized  as 
a scene  via  pattern  matching,  but  details  of  layout  are  not  given 
in  the  categorization  of  unfamiliar  landscapes.  The  observer  has 
available  only  the  virtual  scene  projected  to  the  eye.  Accurate 
perceptual  judgments  that  correspond  to  the  environmental  space 
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require  knowledge  of  the  transformation  operating  on  the  scene. 
However,  such  knowledge  is  not  available  in  the  optic  array. 

There  is  an  infinity  of  distortions  or  transformations  and  none 
of  them  can  be  determined  for  an  unfamiliar  scene  solely  on  the 
basis  of  the  virtual  space. 

If  judgments  do  not  conform  to  the  virtual  space,  but  show 
a correspondence  to  the  environmental  space,  the  transformation 
must  be  estimated  by  the  visual  system  even  though  there  may  be 
no  mathematically  sufficient  basis  for  the  estimate.  The  importance 
of  display  plane  information  suggests  that  the  ability  to  determine 
viewing  point  relative  to  the  display  affects  this  estimate. 

We  propose  as  an  explanation  of  pictorial  compensation  an 
active  perceptual  process  that  undoes  or  discounts  the  effects  of 
distortions  of  virtual  space.  There  are  several  necessary  charac- 
teristics of  such  a process.  Ideally,  such  an  active  compensation 
mechanism  would  operate  when  virtual  space  distortions  were  present, 
but  not  when  virtual  and  environmental  space  were  isomorphic.  Yet, 
as  we  have  pointed  out,  there  is  no  way  that  one  can  tell  that 
virtual  space  is  distorted  without  having  knowledge  of  the  correct 
environmental  space.  The  operation  of  an  active  compensation  pro- 
cess must  depend,  then,  on  certain  assumptions  and  inferences  about 
viewing  pictures. 

Such  an  active  process  would  operate  when  the  actual  viewing 
point  (given  by  stereopsis,  parallax,  accommodation,  etc.)  did  not 
correspond  to  the  assumed  correct  viewing  point.  Assumptions 
regarding  the  correct  viewing  location  could  be  based  on  a variety 
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of  experiential  or  optical  considerations.  For  example,  paintings, 
drawings,  and  photographs  are  often  prepared  for  viewing  from  a 
point  along  a perpendicular  to  the  center  of  the  display.  Thus, 
for  many  displays  the  center  of  projection  may  correctly  be  assumed 
to  be  along  a line  normal  to  the  display  plane. 

Distance  along  this  line  may  be  given  by  experience  or  optical 
clarity.  For  example,  standard  print  formats  have  evolved  so  as 
to  insure  that  they  subtend  approximately  the  same  angle  at  the 
standard  viewing  distances.  Approximately  the  same  distance  re- 
lations are  given  in  considering  fidelity  of  televised  pictures. 

For  standard  TV  displays,  signal  to  noise  ratios  are  highest  at 
viewing  distances  of  approximately  twice  the  height  of  the  display. 
Thus  for  both  photographic  and  electronic  displays,  presentation 
conditions  may  lead  to  an  assumed  correct  viewing  point  along  a 
line  perpendicular  to  the  center  of  the  display  and  at  a distance 
of  twice  the  display  height. 

Such  a set  of  assumptions  provides  a specification  of  the 
correct  viewing  point  in  terms  of  the  relations  to  the  picture  plane. 
It  is  important  to  point  out  certain  important  consequences  of  the 
assumptive  approach.  First,  it  is  obvious  that  such  assumptions 
are  learned  on  the  basis  of  exposure  to  specific  displays.  Chil- 
dren or  others  with  restricted  experience  may  not  have  developed 
such  abilities.  In  much  the  same  fashion,  technological  and  cul- 
tural differences  may  affect  specific  learning.  For  example,  use 
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of  a different  video  standard,  e.g.,  the  European  standard,  gives 
slightly  different  signal  to  noise  ratios  as  a function  of  distance. 
Thus  assumptions  regarding  video  displays  may  be  somewhat  altered. 
Animals  or  individuals  deprived  of  pictorial  experience  may  have 
no  learned  assumptions  about  pictorial  viewing;  their  picture  per- 
ception may  be  based  totally  on  the  virtual  space  or  on  pattern 
matching.  For  them,  space  perception  in  pictures  may  simply  be  a 
case  of  pattern  matching  of  virtual  space. 

Further,  it  is  necessary  to  acknowledge  that  the  various 
compensation  mechanisms  proposed  here  are  interrelated.  For 
example,  both  familiarity  with,  and  assumptions  about  the  environ- 
mental space  could  contribute  to  the  estimation  of  correct  viewing 
point.  If  certain  assumptions  are  made  about  what  is  depicted, 
the  range  of  potentially  correct  viewing  points  is  greatly  reduced. 
Consider  a display  containing  two  surfaces  meeting  in  a dihedral 
angle.  The  lines  from  the  viewer's  eye  to  the  two  implied  vanish- 
ing points  must  meet  at  the  same  angle  as  the  two  planes.  Thus, 
if  it  were  assumed  that  the  corner  were  rectangular,  the  possible 
correct  distance  of  the  viewing  point  would  be  constrained  to  one 
value.  Similarly,  if  it  is  assumed  that  a plane  is  perpendicular 
to  the  frontal,  then  the  correct  viewing  point  must  lie  along  a 
line  perpendicular  to  the  implied  horizon.  In  this  way,  percep- 
tual assumptions  about  the  nature  of  surfaces  and  orientations 
could  indicate  positions  for  the  correct  viewing  point.  Deviations 
from  such  an  assumed  correct  point  could  be  the  basis  for  compensa- 
tion. 
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If  compensation  is  based  on  an  assumed  correct  viewing  point 
relative  to  the  picture  plane,  the  availability  of  picture  plane 
information  should  affect  the  degree  of  compensation.  Recently,  we 
have  completed  several  experiments  in  our  laboratories  which  bear 
on  these  various  mechanisms  for  pictorial  compensation.  In  the 
first,  observers  were  required  to  make  modulus-free  magnitude  es- 
timates of  the  sizes  of  objects  depicted  in  photographs.  The 
objects  were  gray,  square  parallelepipeds  6 cm  in  height  and  vary- 
ing in  width.  The  photos  were  prepared  using  a 4 x 5 view  camera 
so  that  three  facets  of  the  object  were  visible,  and  so  that  the 
center  of  projection  was  25  cm.  away  from  the  center  of  the  photo- 
graph. 

The  photographs  were  viewed  at  distances  of  25  cm.,  50  cm., 
or  75  cm.  by  observers  whose  heads  were  held  motionless  in  a chin 
rest.  All  viewing  was  binocular,  with  the  matted  photograph  held 
in  a rectangular  frame.  The  observer's  head  was  held  in  position 
by  a chin  rest,  and  both  the  chin  rest  and  photos  were  held  in 
position  on  an  optic  bench. 

The  three  viewing  conditions  employed  in  this  study  corres- 
pond to  magnifications  of  1.0,  0.5  and  0.33.  Effects  of  such  view- 
ing conditions  on  virtual  spaces  are  easily  determined.  By  the 
rationale  presented  earlier,  minification  results  in  an  expansion 
of  virtual  space.  Thus,  the  width  of  the  object  is  affected.  The 
precise  amounts  of  geometric  expansion  under  these  three  conditions 
are  presented  in  Table  1. 
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Table  1 

Actual  Size  and  Virtual  Size  of 
Object  under  2 Minifications 


Actual  Size 

Virtual  Size 

Virtual  Size 

m = 1.0 

m = 0.5 

m = 0.33 

1.0 

1.56 

2.21 

1.5 

2.34 

3.31 

2.0 

3.14 

4.41 

2.5 

3.90 

5.52 

3.0 

4.68 

6.62 

3.5 

5.46 

7.73 

4.0 

6.24 

8.83 

4.5 

7.02 

9.94 

5.0 

7.80 

11.04 

5.5 

8.59 

12.14 

6.0 

9.37 

13.25 

6.5 

10.15 

14.35 

7.0 

10.93 

15.46 

7.5 

11.71 

16.56 

8.0 

12.49 

17.67 

3.5 

13.27 

18.77 

9.0 

14.06 

19.89 

9.5 

14.86 

21.00 

10.0 

15.61 

22.10 
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Although  the  estimates  of  the  subjects  were  scale-free,  we 
can  easily  determine  the  relationship  between  environmental,  vir- 
tual, and  perceived  space.  It  is  well  known  that  the  relation- 
ship between  magnitude  estimate  of  size  and  physical  extent  is 
linear,  i.e.,  a power  function  with  an  exponent  of  1.0,  (Marks, 
1974).  If  perceptual  compensation  for  viewing  point  occurred, 
all  functions  should  be  linear  with  a slope  of  1.0.  If  the  per- 
ceived space  matches  the  virtual  space,  we  should  expect  slopes 
equivalent  to  the  relations  in  Table  1.  That  is,  with  m = 1.0, 
the  slope  should  be  1.0;  with  m = 0.5,  the  slope  whould  be  1.40 
and  with  m = 0.33,  slope  should  be  1.99. 

The  results  of  this  experiment  are  presented  in  Figure  8.  It 
is  immediately  clear  from  the  figure  that  almost  complete  compen- 
sation occurred.  There  was  essentially  no  effect  of  the  virtual 
space  relationship  on  judgment.  Virtually  identical  results  were 
obtained  using  line  drawings  that  were  projectively  equivalent  to 
the  edges  of  the  objects  in  the  photograph.  Figure  9 depicts  those 
data.  Although  different  people  were  used,  and  consequently  a 
different  modulus  is  evident,  almost  identical  relationships  are 
obtained.  Thus  it  is  clear  that  virtually  complete  compensation 
takes  place  for  expansion  of  virtual  space  due  to  minification. 

One  difficulty  with  these  results  is  that  although  we  are 
assured  that  compensation  occurred,  v/e  have  no  means  of  determining 
its  basis.  Given  the  experimental  conditions,  any  or  all  of  the 
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mechanisms  discussed  above  could  have  been  operating.  For  example, 
the  objects  may  have  been  categorized  as  rectangular  regardless 
of  the  virtual  space,  and  observers'  estimates  based  on  shape  re- 
lations. Or  an  assumption  about  the  correct  viewing  point  could 

t ) 

have  been  made  based  on  picture  plane  location  or  on  vanishing 
point  relations.  In  order  to  avoid  such  difficulties,  and  to 
determine  more  precisely  the  extent  and  nature  of  pictorial  com- 
pensation we  have  undertaken  several  other  experiments  examining 
other  indicators  of  perceptual  accuracy. 

One  consequence  of  the  dilation/compensation  of  space  which 
results  from  changes  in  viewing  point  involves  changes  in  virtual 
orientation.  Magnification  makes  the  virtual  orientation  of  sur- 
faces more  frontal,  minification  makes  it  less  frontal.  Thus  to 
evaluate  the  nature  of  compensation  for  distortion,  we  have  used 
observer's  direct  scale  estimates  of  orientation  of  lattices.  In 
our  studies  of  magnifications,  slanted  lattices  were  computer- 
generated and  displayed  on  a CRT.  Observers  viewed  the  display 
binocularly  and  made  direct  estimates  of  orientation  in  degrees 
and  expressed  their  response  on  a computer  keyboard.  The  conven- 
tion was  adopted  with  90  degree  being  the  frontal,  and  orientations 

I 

with  the  top  edge  further  away  were  denoted  as  less  than  90.  In 
the  first  experiment,  observers  always  viewed  the  display  from 
112  cm.;  however,  the  correct  center  of  proportion  across  conditions 
was  28,  56,  84,  112,  225,  337,  and  450  cm.  Thus  7 degrees  of  magni- 
fication (m  = 4.0  to  m = 0.25)  were  induced.  The  virtual  orientation 


I 


Perception  of  Pictured  Space 
45 


*J 


of  surfaces  under  these  magnification/minification  ratios  is  depicted 
in  Figures  10  and  11.  No  independent  information  for  the  slant  of 
the  lattice  was  available,  and  the  distance  between  viewer  and  dis- 
play was  constant.  Consequently,  the  exact  nature  of  the  transfor- 
mation and  its  degree  is  not  specified  optically.  Under  such  condi- 
tions, we  should  expect  that  judgments  should  be  determined  by  and 
in  correspondence  with  virtual  space,  at  least  within  limits  im- 
posed by  familiar  size  and  shape. 

The  data  for  this  experiment  are  depicted  in  Figure  12  for 
magnifications,  and  in  Figure  13  for  minifications.  It  is  evident 
from  comparing  these  figures  that  the  range  of  judgment  is  restricted. 
As  has  been  verified  in  numerous  experiments,  slant  judgments  in  a 
variety  of  experimental  circumstances  tend  to  be  more  frontal  than 
is  specified  by  the  virtual  space.  The  reduced  range  of  judgment  in 
the  present  case  is  probably  the  result  of  the  conflict  between 
monocular  perception  and  binocular  information  for  the  display  plane. 
However,  there  is  no  doubt  that,  within  the  general  accuracy  con- 
straints for  slant  perception,  virtual  space  exerts  a significant 
influence  on  perception.  For  the  most  extreme  minification  of  .25, 
the  relation  between  judgments  and  physical  slant  is  primarily 
linear  with  a slope  of  approximately  .8.  For  the  most  extreme  mag- 
nification, the  relationship  is  cubic.  It  is  worth  pointing  out 
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These  results  suggest  that  the  compensation  for  display  plane 
orientation  occurs  when  the  viewing  angle  from  the  eye  to  the  dis- 
play varies  from  some  system  reference  value.  Such  a set  of 
picture  viewing  assumptions  would  account  for  the  operation  of  a 
compensation  process.  We  propose  that  the  process  itself  is  a 
relatively  simple  one  that  discounts  the  effects  of  viewing  point 
dislocations.  Consider  a situation,  like  that  described  above,  in 
which  an  individual  makes  judgments  of  the  orientation  of  a surface 
while  viewing  a display  that  is  slanted  relative  to  the  line  of 
sight.  Since  9'  is  specified  by  surface  gradients,  and  5 by  binocu- 
lar vision  or  frame  perspective,  exact  correspondences  for  9 could 
be  determined  using  the  relationship  defined  in  the  equation  above. 

Although  such  a process  may  appear  rather  complex,  a function 
of  the  sort  described  above  is  necessary.  In  our  experiment, 
virtual  orientation  was  a curvilinear  function  of  environmental 
orientation  when  the  display  plane  is  viewed  at  a slant.  Yet  for 
both  the  45°  (.79  rad)  and  135°  (2.36  rad)  viewing  angles,  judg- 
ments of  orientation  were  coll  inear  without  higher  order  components. 
This  indicates  that  the  compensation  process  is  a non-linear  one, 
and  also  indicates  that  simple  differences  between  9 and  6 can  not 
be  the  basis  of  compensation. 

The  present  hypothesis  can  also  account  for  the  paradox  of 
picture-in-a-picture  perception  (Pirenne,  1970).  Consider  a slanted 
surfacp  viewed  directly,  with  a horizontal  line  of  regard,  then 
surface  slant,  9,  equals  projected  slant,  9'.  If  a photograph  of 
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the  surface  is  viewed  from  the  wrong  station  point,  projected  slant, 
0',  is  a function  of  surface  slant,  0,  and  angle  6 as  given  by  the 
above  equation.  If  this  photograph  is  itself  photographed  and  the 
second  photo  is  viewed  from  the  wrong  station  point,  a second  trans- 
formation occurs,  and  projected  slant,  0",  is  a function  of  0‘,  0, 

6,  and  5'.  The  paradox  results  from  the  fact  that  it  is  apparently 
difficult  to  compensate  for  the  transformation  while  viewing  the 
second  photo.  If  the  second  photograph  is  viewed  from  its  correct 
station  point,  a situation  geometrically  equivalent  to  that  in  the 
above  experiment  is  obtained,  and  compensation  could  only  occur  if 
6 ( the  inclination  of  the  first  photograph  relative  to  the  optical 
axis  of  the  camera  taking  the  second  picture)  is  known  (see  Pirenne's 
Figures  8.1  and  8.2  in  this  regard).  If  the  second  photograph  is 
viewed  from  the  wrong  station  point,  two  transformations  occur 
and  compensation  (accurate  perception  of  0)  requires  accurate 
estimates  of  0",  0',  and  6',  and  6.  Consequently,  errors  in  estimat- 
ing any  of  these  parameters  will  result  in  errors  of  compensation. 
Additionally,  since  two  transformations  involving  nested  calcula- 
tions are  involved,  such  errors  will  be  exaggerated.  Further,  since 
binocular  or  monocular  cues  do  not  specify  the  orientation  of  the 
first  picture  relative  to  the  second  or  relative  to  the  observer, 
an  estimate  of  6 may  be  unavailable  to  the  observer. 

A similar  process  can  be  proposed  based  on  the  assumed  content 
of  the  display  rather  than  the  assumed  viewing  relations.  For  a 
familiar  object,  the  discrepancy  between  0'  (registered)  and  0 
(assumed)  is  an  indicator  of  6.  This  value  of  6 can  then  be  used 
in  applying  compensation  processes  to  other  objects  and  surfaces 
depicted  in  the  display.  There  is  evidence  that  such  assumptions 


I 


Perception  of  Pictured  Space 
52 


v 

i 

regarding  context  can  affect  compensation.  Cooper  (unpublished) 
required  adults  to  judge  rectangularity  of  parallelepipeds  which 
were  distorted  in  virtual  space  by  a shear.  When  the  object  was 
included  in  the  display  so. that  its  edges  and  sides  were  parallel 
to  the  corners  and  walls  of  the  room  (also  in  the  display),  judg- 
ments were  essentially  perfect.  When  the  position  of  the  target 
was  altered  so  that  edges  and  sides  were  not  parallel  to  the  room's 
corners  and  walls,  judgmental  accuracy  decreased  to  70%.  The  rela- 
tionship between  the  distorted  objects  and  the  reference  axes  pro- 
vided by  the  assumed  rectangularity  of  the  room  provided  a way  to 
discount  distortions  of  virtual  space. 

In  summary,  the  lack  of  a strict  correspondence  between  perceived 
and  virtual  space,  coupled  with  the  existence  of  correspondences 
between  environmental  and  perceived  space  suggest  that  pictorial 
perception  involves  a unique  visual  compensation  process.  We  pro- 
pose two  possible  mechanisms  that  may  be  involved  in  such  compensa- 
tion. First,  a simple,  passive  pattern  matching  may  be  involved. 

Once  an  object  or  pattern  meets  certain  minimal  criteria,  categori- 
zation occurs.  Extreme  distortions  are  necessary  before  such  cate- 
gorization can  be  overcome. 

Secondly,  we  propose  the  existence  of  an  active  perceptual 
compensation  process  that  discounts  optical  distortions  caused  by 
changes  in  the  viewing  point.  Since  there  is  no  optical  information 
for  the  correctness  of  viewing  point,  such  compensation  must  be 
based  on  assumption  regarding  appropriate  viewing  conditions  for 
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representational  displays.  The  research  that  we  have  conducted 
indicates  that  observers  can  compensate  for  both  the  effects  of 
shear  and  of  magnification  on  spatial  layout.  Our  data  suggest 
that  compensation  is  a non-linear  process  modulated  by  the  dif- 
ference between  actual  and  assumed  correct  viewing  points.  The 
actual  location  of  these  assumed  correct  points,  and  the  means  by 
which  such  assumptions  develop  are  clearly  two  important  issues 
for  future  work  in  pictorial  space  perception. 
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